44 research outputs found

    Bit-Vector Search Filtering with Application to a Kanji Dictionary

    Get PDF

    Aspects of Metric Spaces in Computation

    Get PDF
    Metric spaces, which generalise the properties of commonly-encountered physical and abstract spaces into a mathematical framework, frequently occur in computer science applications. Three major kinds of questions about metric spaces are considered here: the intrinsic dimensionality of a distribution, the maximum number of distance permutations, and the difficulty of reverse similarity search. Intrinsic dimensionality measures the tendency for points to be equidistant, which is diagnostic of high-dimensional spaces. Distance permutations describe the order in which a set of fixed sites appears while moving away from a chosen point; the number of distinct permutations determines the amount of storage space required by some kinds of indexing data structure. Reverse similarity search problems are constraint satisfaction problems derived from distance-based index structures. Their difficulty reveals details of the structure of the space. Theoretical and experimental results are given for these three questions in a wide range of metric spaces, with commentary on the consequences for computer science applications and additional related results where appropriate

    Approximate Furthest Neighbor in High Dimensions

    Get PDF
    Much recent work has been devoted to approximate nearest neighbor queries. Motivated by applications in recommender systems, we consider approximate furthest neighbor (AFN) queries. We present a simple, fast, and highly practical data structure for answering AFN queries in high-dimensional Euclidean space. We build on the technique of Indyk (SODA 2003), storing random projections to provide sublinear query time for AFN. However, we introduce a different query algorithm, improving on Indyk's approximation factor and reducing the running time by a logarithmic factor. We also present a variation based on a query-independent ordering of the database points; while this does not have the provable approximation factor of the query-dependent data structure, it offers significant improvement in time and space complexity. We give a theoretical analysis, and experimental results

    Visualizing Cancer

    Get PDF
    Imaging has had a profound impact on our ability to understand and treat cancer. We invited some experts to discuss imaging approaches that can be used in various aspects of cancer research, from investigating the complexity and diversity of cancer cells and their environments to guiding clinical decision-making

    On k-enclosing objects in a coloured point set

    Get PDF
    We introduce the exact coloured k -enclosing object problem: given a set P of n points in R 2 , each of which has an associated colour in f 1 ;:::;t g , and a vec- tor c = ( c 1 ;:::;c t ), where c i 2 Z + for each 1 i t , nd a region that contains exactly c i points of P of colour i for each i . We examine the problems of nd- ing exact coloured k -enclosing axis-aligned rectangles, squares, discs, and two-sided dominating regions in a t -coloured point setPostprint (published version
    corecore